Incremental speaker adaptation with minimum error discriminative training for speaker identification
نویسندگان
چکیده
Minimum Classification Error (MCE) has shown to be effective in improving the performance of a speaker identification system [1]. However, there are still problems to solve, such as the variability of the voice characteristics of a particular speaker through time. In this work, we analyze the degradation of a GMM-based textindependent speaker identification system when using test data recorded over 6 months after the training session. And trying to avoid this degradation we study the use of supervised adaptation based on Maximum a Posteriori (MAP), and MCE. These techniques have been shown to provide good results for speaker adaptation in speech recognition. The major result we have obtained is that by starting with GMM models trained with only speech from session 1, similar identification results can be obtained for all the other sessions using an incremental adaptation using only 2.5 seconds of speech per speaker and session as data for the MCE training adaptation procedure. We have also found that, in our extreme experimental setup, MAP becomes unhelpful when combined with MCE adaptation.
منابع مشابه
Incremental Speaker Adaptation with Minimum Error Discriminative Training for Speaker Identification
Minimum Classification Error (MCE) has shown to be effective in improving the performance of a speaker identification system [1]. However, there are still problems to solve, such as the variability of the voice characteristics of a particular speaker through time. In this work, we analyze the degradation of a GMM-based textindependent speaker identification system when using test data recorded ...
متن کاملMinimum phone error discriminative training for Mandarin Chinese speaker adaptation
Speaker adaptation is an efficient way to model a new speaker from an existing speaker-independent model with limited speaker-dependent data. In this paper, we investigate the use of discriminative training schemes based on the minimum phone error (MPE) criterion to improve a well-known speaker adaptation technique, a combination of transform-based adaptation and Bayesian adaptation. Furthermor...
متن کاملDiscriminative MCE-based speaker adaptation of acoustic models for a spoken lecture processing task
This paper investigates the use of minimum classification error (MCE) training in conjunction with speaker adaptation for the large vocabulary speech recognition task of lecture transcription. Emphasis is placed on the case of supervised adaptation, though an examination of the unsupervised case is also conducted. This work builds upon our previous work using MCE training to construct speaker i...
متن کاملComparison of ML and DT speaker adaptation methods
In this paper, we study how discriminative and Maximum Likelihood (ML) techniques should be combined in order to maximize the recognition accuracy of a speaker-independent Automatic Speech Recognition (ASR) system that includes speaker adaptation. We compare two training approaches for speaker-independent case and examine how well they perform together with four different speaker adaptation sch...
متن کاملComparison of discriminative training methods for speaker verification
The maximum likelihood estimation (MLE) and Bayesian maximum a-posteriori (MAP) adaptation methods for Gaussian mixture models (GMM) have proven to be effective and efficient for speaker verification, even though each speaker model is trained using only his own training utterances. Discriminative criteria aim at increasing discriminability by using out-of-class data. In this paper, we consider ...
متن کامل